Boosting the Concordance Index for Survival Data – A Unified Framework To Derive and Evaluate Biomarker Combinations
نویسندگان
چکیده
The development of molecular signatures for the prediction of time-to-event outcomes is a methodologically challenging task in bioinformatics and biostatistics. Although there are numerous approaches for the derivation of marker combinations and their evaluation, the underlying methodology often suffers from the problem that different optimization criteria are mixed during the feature selection, estimation and evaluation steps. This might result in marker combinations that are suboptimal regarding the evaluation criterion of interest. To address this issue, we propose a unified framework to derive and evaluate biomarker combinations. Our approach is based on the concordance index for time-to-event data, which is a non-parametric measure to quantify the discriminatory power of a prediction rule. Specifically, we propose a gradient boosting algorithm that results in linear biomarker combinations that are optimal with respect to a smoothed version of the concordance index. We investigate the performance of our algorithm in a large-scale simulation study and in two molecular data sets for the prediction of survival in breast cancer patients. Our numerical results show that the new approach is not only methodologically sound but can also lead to a higher discriminatory power than traditional approaches for the derivation of gene signatures.
منابع مشابه
A Gradient Boosting Algorithm for Survival Analysis via Direct Optimization of Concordance Index
Survival analysis focuses on modeling and predicting the time to an event of interest. Many statistical models have been proposed for survival analysis. They often impose strong assumptions on hazard functions, which describe how the risk of an event changes over time depending on covariates associated with each individual. In particular, the prevalent proportional hazards model assumes that co...
متن کاملمقایسه مدل شبکه عصبی مصنوعی و رگرسیون پارامتری در پیشبینی بقای بیماران مبتلا به سرطان معده
Background & Objective: Using parametric models is common approach in survival analysis. In the recent years, artificial neural network (ANN) models have increasingly used in survival prediction. The aim of this study was to predict of survival rate of patients with gastric cancer by using a parametric regression and ANN models and compare these methods. Methods: We used the data of 436 gast...
متن کاملمدلسازی توام دادههای بقا و طولی و کاربرد آن در بررسی عوامل موثر بر آسیب حاد کلیوی
Background: In many clinical trials and medical studies, the survival and longitudinal data are collected simultaneously. When these two outcomes are measured from each subject and the survival variable depends on a longitudinal biomarker, using joint modelling of survival and longitudinal outcomes is a proper choice for analyzing the available data. Methods: In this retrospective archiv...
متن کاملIncreasing the accuracy of the classification of diabetic patients in terms of functional limitation using linear and nonlinear combinations of biomarkers: Ramp AUC method
The Area under the ROC Curve (AUC) is a common index for evaluating the ability of the biomarkers for classification. In practice, a single biomarker has limited classification ability, so to improve the classification performance, we are interested in combining biomarkers linearly and nonlinearly. In this study, while introducing various types of loss functions, the Ramp AUC method and some of...
متن کاملA Hybrid Framework for Building an Efficient Incremental Intrusion Detection System
In this paper, a boosting-based incremental hybrid intrusion detection system is introduced. This system combines incremental misuse detection and incremental anomaly detection. We use boosting ensemble of weak classifiers to implement misuse intrusion detection system. It can identify new classes types of intrusions that do not exist in the training dataset for incremental misuse detection. As...
متن کامل